Information-theoretic temporal Bell inequality and quantum computation 
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An information-theoretic temporal Bell inequality is formulated to contrast classical and quantum 
computations. Any classical algorithm satisfies the inequality, while quantum ones can violate it. 
Therefore, the violation of the inequality is an immediate consequence of the quantumness in the 
computation. Furthermore, this approach suggests a notion of temporal nonlocality in quantum 
computation. 
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I. INTRODUCTION 

Separation between classical and quantum phenomena 
plays a crucial role in understanding the weirdness of 
quantum physics and has recently been extensively inves- 
tigated, especially in the context of quantum information 
science. One way of drawing a rigid distinction between 
them is to show the violation of the Bell inequality pj . 
In this paper, we develop a Bell-type inequality that dis- 
tinguishes between classical and quantum computations. 
Here, in order to allow arguments based on conditional 
entropy between measurement outcomes of two differ- 
ent observables, we employ the information-theoretic Bell 
inequalities formulated by Braunstein and Caves 0, 01 • 
Furthermore, instead of discussing correlations between 
observables of distantly located physical systems, we fo- 
cus on one and the same physical system and analyze 
correlations between measurement outcomes at different 
times as in the argument on the temporal Bell inequality 
initiated by Leggett and Garg . 

The information-theoretic temporal Bell inequality is 
formulated for classical algorithms. It is satisfied by any 
classical algorithm but can be violated by quantum ones. 
As an example to show the violation of the inequality, we 
discuss the so-called "database" search problem, which 
is formally stated as follows: Given a black box (oracle) 
that calculates the function F of x € {0, . . . , 2™ — 1} such 
that, for unknown s, 




F(x) = 



find the unknown number s by asking the oracle as few 
times as possible. Note that we do not assume any struc- 
ture within the black box, and thus all we can do is just 
input an n-bit number and obtain an output. (Tricks 
such as requesting some portion of n bits, e.g., whether 
the first bit of the n bits is or 1, are not allowed in the 
present setting.) It is well-known that it takes 0(2 n ) or- 
acle queries in classical algorithms to solve the problem, 
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while it requires only 0{y2 n ) queries in Grover's algo- 
rithm for quantum computation [5| • It is shown that the 
information-theoretic temporal Bell inequality is violated 
in Grover's algorithm. 

Bell-type inequalities are conventionally invoked to 
find out whether some profound physical principle such as 
local realism [l| or macroscopic realism j4| holds in quan- 
tum theory. However, the information-theoretic tempo- 
ral Bell inequality is proposed as just one way of discrim- 
inating between classical and quantum computations. It 
is not intended here to try to refute nor confirm any phys- 
ical principle. 

Another motivation behind this work is to advance 
our understanding of the role of temporal correlations 
in quantum information processing. Temporal correla- 
tions in quantum phenomena have been studied in the 
foundations of quantum theory from the viewpoint of 
Bell-type inequalities since the original proposal of the 
Leggett-Garg inequality 0,0- In the context of infor- 
mation processing, it was rephrased quite recently from 
a slightly different perspective 0, Hi- The present ap- 
proach is developed in the belief that, in some cases, the 
power of quantum information processing may also lie in 
temporal correlations of a truly quantum nature as well 
as in spatial correlations due to entanglement. 

An information-theoretic aspect of the "database" 
search problem has already been studied in |j| , where the 
maximum amount of information obtainable from a sin- 
gle oracle query is analyzed. The present scenario, on the 
other hand, mainly focuses on the entire process of the 
algorithms and thus rather discusses collective behavior 
of many successive oracle queries in terms of information- 
theoretic quantities. 

This paper is organized as follows. First, the 
information-theoretic Bell inequalities derived in |2| are 
briefly revisited in Sec.^ Next, we move to the formula- 
tion of an information-theoretic temporal Bell inequality 
in Sec. IIIII Then, Sec. IIVI gives a proof of the violation 
of the inequality in quantum computation. In Scc.0 we 
discuss some implications of the inequality, especially a 
notion of temporal nonlocality suggested by the violation 
of the inequality, and finally summarize the paper. 
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II. INFORMATION-THEORETIC BELL 
INEQUALITIES REVISITED 

From a mathematical point of view, the existence of 
a valid joint probability distribution of the measurement 
outcomes of all relevant observables is essential to Bell 
inequalities 1 ; any classical theory can assign definite val- 
ues to all observables irrespective of measurement pro- 
cesses in principle and any marginal probability distri- 
bution must be consistent with the entire joint proba- 
bility distribution, while this is not generally the case 
in quantum theory. The information-theoretic version 
of Bell inequalities is a p araphrase of this fact in terms 
of Shannon entropy [l(Tj . which is defined as H(X) — 
— ^2 x p(x) log 2 p(x) for an observable X with {p(x)} be- 
ing the probability distribution of measurement outcomes 
x. Since various inequalities between entropies in in- 
formation theory are derived for valid joint probability 
distributions, the inequalities might not be satisfied for 
invalid joint probability distributions generated in quan- 
tum theory. This is the crux of the information-theoretic 
Bell inequalities. In fact, they are derived by applying ba- 
sic inequalities in information theory to valid joint prob- 
ability distributions in classical theory as shown below. 

Consider, say, four observables A , A\, A 2 , A 3 and 
their joint entropy H(Aq, A\, A 2 , A3). The chain rule 
for conditional entropies yields H (A , A\, A 2 , A3) = 
H(A 3 \A , A U A 2 ) + H(A 2 \A , A ± ) + H(A 1 \A ) + H(A ). 
(For details of basic inequalities in information theory, 
see (II) . f° r example.) Since conditioning never increases 
entropy, e.g., H(A 2 \A ,A 1 ) < H(A 2 \A 1 ), we have 

H{A^A U A 2 ,A 3 ) < H{A a \A 2 )+H(A 2 \A 1 ) 

+ H(A 1 \Ao)+H(A Q ). (1) 

Our information-theoretic temporal Bell inequality is 
based on this form, but before moving on to it, it is 
worthwhile mentioning how Eq. (JTJ is related to the well- 
known argument on the (spatial) Bell inequality between 
two distantly located parties (Alice and Bob) with two 
qubits 0| . Suppose A$ and A 2 (with even subscripts) 
represent Alice's observables and A\ and A3 (with odd 
subscripts) Bob's. The information contained in some 
quantities is never smaller than that in a subset of them, 
i.e., H(A ,A 3 ) < H(A ,A 1 ,A 2 ,A 3 ), thus we have 

H{A 3 \A ) < H(A 3 \A 2 ) + H{A 2 \A\) + H{A X \A ), (2) 

which is one of the information theoretic (spatial) Bell 
inequalities given in 0. The conditional entropy be- 
tween Alice's and Bob's local observables satisfies Eq. 
classically, because there exists a valid joint probability 



Prom a physical point of view, needless to say, it is essential for 
the conventional (spatial) Bell inequalities that we be able to 
draw the conclusion that quantum theory cannot be described 
by any local hidden variable theory. 



distribution of all the observables. However, it can be 
proved to be violated in quantum theory, although we will 
not go into the details of the spatial case here. Roughly 
speaking, the existence of a (possibly invalid) joint prob- 
ability distribution over all the observables corresponds 
to the assumption of realism, and the consistency of all 
the marginal probability distributions corresponds to the 
assumption of locality in the spatial argument. [Schu- 
macher also proved inequalities similar to Eq. (J2J and 
described an intuitive geometric picture of violating the 
inequality [l^. Cerf and Adami explored entropic Bell 
inequalities by considering entropy Venn diagrams 14].] 
The form of Eq. is suitable for our argument on a 
temporal version and can be easily generalized to the one 
with L + 1 observables, 

H(A ,...,A L ) < H(A L \A L _ 1 ) + --- + H(A 1 \A ) 

+ H(A ). (3) 

In the next section, we switch to an argument on tem- 
poral correlations by choosing observables Ai as those 
of one and the same physical system at different times 
from to to ii, which correspond to L different computa- 
tional steps in our scenario. This inequality also should 
be satisfied when there exists a valid joint probability dis- 
tribution for all observables, although it may be violated 
in time evolution governed by quantum theory 



III. INFORMATION-THEORETIC TEMPORAL 
BELL INEQUALITY 

Next, we derive an information-theoretic temporal Bell 
inequality by applying Eq. © to the "database" search 
problem mentioned above. Consider an algorithm solv- 
ing the problem deterministically with L oracle queries. 
Suppose there exists a bit (or a qubit) in which the out- 
put of the oracle query is stored, and the observables 
Ai (i = 0, . . . , L) correspond to the two-valued (0 or 1) 
measurement on the (qu)bit after the ith oracle query. 
(For a reason stated below, Aq is defined to have the 
fixed initial value of the output register "0.") Hence, at 
least classically, by measuring all observables Ai, the so- 
lution to the problem can be found. For instance, in the 
most straightforward classical algorithm, we input the 
numbers from zero to 2™ — 1 to the oracle successively 
and read each output of the oracle as a measurement 
outcome of A.- L . This process requires at most L = 2™ 
queries and yields the solution with probability 1. The 
aim of this problem is to find the unique item x = s out 
of 2™ equally likely numbers; hence, we eventually obtain 
n bits of information by solving the problem. Further- 
more, the amount of information obtained by measuring 
the outputs of the oracle cannot be less than the amount 
of information that is supposed to be obtained by solving 
the problem itself. Therefore, we have 

n<H(A ,...,A L ). (4) 
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If it were possible to infer the solution by observing ran- 
dom variables containing less information than the solu- 
tion itself, then it would be a contradiction. [Note that 
H(Aq, . . . , Al) might be smaller than n bits in unsuc- 
cessful algorithms, but here we concentrate only on de- 
terministic algorithms solving the problem successfully] 
By combining Eqs. © and |@J, we obtain the 
information-theoretic temporal Bell inequality for the 
present problem, 

n<H(A L \A L _ 1 ) + ---+H(A 1 \A ). (5) 

[Note that the last term on the right-hand side of Eq. 
H(A ), is omitted because A is defined to have the fixed 
value "0" so that this unwanted term always vanishes.] 
From a classical point of view, this inequality means that 
the sum of the conditional entropy between successive 
oracle queries is sufficient to constitute the amount of 
information contained in the solution. [In fact, it can 
be proven that the inequality in Eq. JSJ holds, for in- 
stance, in the most straightforward classical algorithm 
given above by an explicit calculation.] In other words, 
suppose there are many identical copies of a classical 
computer running the algorithm separately, but only the 
outputs of two randomly chosen successive oracle queries 
are measured and the relevant conditional entropy is cal- 
culated independently; for the first computer, only A% 
and A3 are measured, and for the second one, only A\§ 
and An, and so on. Then, accumulating all the small 
pieces of "progress" achieved in different computers, we 
can infer the solution to the problem. This picture makes 
it easy to draw an analogy with ordinary experiments on 
spatial Bell inequalities: First, prepare many identical 
copies of an entangled pair and then measure randomly 
chosen observables. In our scenario, the timing of mea- 
suring the outputs of the oracle is randomly chosen in- 
stead in the manner of Leggett and Garg. 2 The point 
is that outputs of the oracle, i.e., observables Ai have 
predetermined values and that there exists a valid joint 
probability distribution of them in any classical compu- 
tation. 



IV. VIOLATION IN QUANTUM 
COMPUTATION 

As we have seen in the previous section, the 
information-theoretic temporal Bell inequality (JSJ) holds 
in any classical algorithm. However, this is not the 
case for quantum computation. In this section, we show 
that the inequality is actually violated in Grover's al- 
gorithm p|. For details of Grover's algorithm, see [l5| . 
Let the initial state of a quantum computer be = 



2 In the original Leggett-Garg inequality, we randomly choose two 
different times, say t\ and t3, out of four possibilities, fi, . . . , <4, 
however. 



~^2^J2'j=o 1 b')*|0)o) where the first and second registers 
denoted as | }i and | ) represent the input and the out- 
put of the oracle, respectively. The output register is 
usually defined as -i|(|0) o — |1) ) instead of |0) o so that 
the phase flip operation is automatically performed and 
the number of oracle queries is reduced by half. Nev- 
ertheless, we stick to |0) o because we need to measure 
the outputs of oracle queries in the middle of the com- 
putation. Hence, the Grover iteration is represented as 
G = (2\ip)(4>\ — I)Oa z O, where the first term represents 
the "inversion about average" operation, O is the oracle 
operator defined as 0\x) i\y) = \x) i\y (B F (x)) , and <t z is 
the Pauli matrix acting only on the output register | ) D . 
[The second oracle operation in G is inevitable in order 
to erase (undo) the previous oracle operation and disen- 
tangle the output register from the input one so that the 
interference due to the "inversion about average" opera- 
tion works properly.] When we measure the outputs of 
oracle queries (as is explained in the next paragraph), 
the output register | ) D is measured in the computational 
basis {|0), |1)} right after the first oracle operation O in 
the relevant Grover iteration. Note that this measure- 
ment is not included in the original version of Grover's 
algorithm. It is introduced here to carry out the scenario 
showing the violation of our inequality. 

An "experiment" to test our inequality goes as follows, 
which certainly parallels the classical argument. First, we 
prepare many identical copies of a quantum computer, 
and then run Grover's algorithm in each computer in- 
dependently. Then, we randomly choose two computa- 
tional steps k and k + 1 out of 0(s/2~™) possibilities for 
each computer and measure the output register | ) Q only 
at the fcth and (k + l)th Grover iterations. (We mea- 
sure it right after the first oracle operations O in each 
Grover iteration G.) After the first measurement on the 
output register, the state of the input register jumps into 
either the state we are looking for, \s)i, or the (2 n — 1) di- 
mensional subspace orthogonal to |s)i, which we denote 
as \a)i — ^ 2 t_ 1 Tlj^s I n either case, the quantum 
computer continues the rest of the Grover iteration, i.e., 
(2\ip)(ip\ — I)Oa z , and thus the state becomes a superpo- 
sition of \a)i and \s)i again. We carry out these proce- 
dures in all the quantum computers independently with 
random choices of k > 0. 3 

By an explicit calculation, for k > 1, we have 



For an exact correspondence to the classical scenario, we should 
not continue the Grover iteration when we luckily find the so- 
lution s in the first measurement of the output, i.e., in the 
fcth Grover iteration. The succeeding dynamics should be re- 
placed by a trivial one to make the value of the output register 
"0" thereafter. However, it is easily shown that the quantity 
H(A k+ i\A k ) in this scenario is always smaller than that calcu- 
lated in the present scenario. That is, if the inequality is violated 
in our scenario, then it is violated in the above exact one as well. 
Therefore, we choose the one developed in the text to make the 
following argument simpler. 
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H(Ak+i\Ak) = £f(cos 2 #), where 9 is denned as cos | = 

yjl — and H(x) = — a;log 2 x — (1 — x) log 2 (l — x) is 

the binary entropy. We also have H(A\ \Aq) — iJ(cos 2 |). 
In what follows, we prove that inequality (jSJ) with L = 
0(V2™) is violated. First, since sin 2 9 = — 2 2 ™- 2 > 
we have sin 2 9 < <\{n> 3). Thus, H(cos 2 9) = 

H(sin 2 9) < H(j^). Defining f(x) = —x\og 2 x, the in- 
equality f(x) > /(l — x) holds for < x < \. Hence, 
we have H(^) = f(^) + /(l - ^) < 2/(^) = 
2 n\ ■ Furthermore, we clearly have iJ(cos 2 |) < 1. By 
combining these inequalities, we can estimate the right- 
hand side of inequality (JSJ) for L = \/2™ as follows: 

H ( A V2*\ A V2*-i) + ■■■ + H{A 2 \A 1 ) + H{Ax\Ao) 
= (V¥ - l)H (cos 2 9) + H ^cos 2 

, / — ,n- 2 
< (V2"-l)- — 

n - 2 n - 2 

- 2 («/2)-3 ~ 2 « -3 ^ 

Therefore, for large ri and L = 0(^/2^), we obtain 

H(A L \A L _ 1 ) + --- + H(A 1 \A ) <n. (7) 

This violation of the inequality is an immediate conse- 
quence of the quantumness in the computation. Note 
that the present inequality is defined only for algorithms 
solving the problem successfully. Thus possible violations 
of the inequality in unsuccessful classical algorithms are 
not necessarily due to the quantum nature of the compu- 
tation. It is no wonder that the inequality can be violated 
by unsuccessful classical algorithms because they fail in 
accumulating a sufficient amount of information to reach 
the solution. 4 



V. DISCUSSION 

In this section, we discuss some implications of this 
approach. First, we deal with some possible objections 
to it, and then discuss a notion of temporal nonlocality 
suggested by the violation of our inequality. 

There may be an objection that unnecessarily inter- 
rupting the quantum algorithm by measurement obvi- 
ously precludes the computation and seems pointless. 
However, the whole point of this paper is not to provide 
any useful techniques to improve quantum algorithms but 



to offer a novel way of characterizing quantum computa- 
tion. In fact, in our argument, many identical copies of 
a quantum computer are required and are not fully ex- 
ploited. One copy of the quantum computer would be 
sufficient to achieve fast quantum computation. Never- 
theless, this does not necessarily mean that the present 
scenario is useless, which can be seen in analogy to the 
well-known argument on spatial Bell inequality experi- 
ments. We consume many copies of an entangled pair to 
demonstrate the nonlocal character of the pair, while we 
need only one copy to accomplish quantum teleportation 
|16| or superdense coding p"?| . Similarly, we require many 
copies of a quantum computer and spoil their computa- 
tional processes to demonstrate the quantumness in the 
computation, while we need only one copy to accomplish 
fast quantum computation. Therefore, in spite of the fact 
that it seems quite indirect and inefficient, the present 
approach over numerous computers seems promising as 
a means of fully appreciating the essential difference be- 
tween classical and quantum computations from a phys- 
ical point of view. 5 

The number of terms in inequality JSJ is different 
in classical and quantum algorithms; classically, it has 
0(2") terms, while quantumly, 0(-\/2™) terms. Thus, it 
might not be surprising that the inequality is violated in 
quantum computation. The point is, however, whether 
we can assign definite values to the outputs of all the 
oracle queries and there exists a valid joint probability 
distribution over all the outputs. Detecting the violation 
of the inequality is considered as one way of clarifying 
this point. In a conventional view, we usually think that 
a quantum computer is in a superposition and thus that 
there exists no definite value of registers in each com- 
putational step. This view underlies the standard inter- 
pretation of quantum computation: parallel computation 
due to quantum superposition. 

However, what if we try to interpret Grover's algorithm 
in a realistic way with a (global) hidden variable model? 
Every input and output of oracle queries should have a 
predetermined value in this view. Nevertheless, the al- 
gorithm still succeeds in finding the solution in 0(\/2™) 
steps. This seems quite bizarre but is possible once we 
accept this perspective. The trick is that the quantum 
computer somehow selects promising inputs to the or- 
acle without exhausting all the possibilities. Strangely, 
the solution x ~ s is always included in those promis- 
ing 0(-\/2™) inputs. It looks as if the quantum computer 
partially knew which one would fail if it were fed into 
the oracle. Classically, it is natural to keep excluding the 
previously used inputs until we hit the solution. How- 
ever, the quantum computer could refer to the outcomes 
of the oracle queries corresponding to previously unused 



4 This is reminiscent of the condition in standard spatial Bell in- 
equality experiments that local measurements performed by two 
distantly located parties, Alice and Bob, should be spacelike sep- 
arated. If they were allowed to communicate and organize their 
measurement settings, they could violate Bell inequalities even 
within classical theories. 



5 Incidentally, I believe that quite a few physicists would be inter- 
ested in the principle underlying qualitative differences between 
classical and quantum computations, even if there were no com- 
putational speed-up in quantum computation. 
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inputs. In other words, the quantum algorithm seems 
to be able to reach the solution without accumulating 
a sufficient amount of information from a classical point 
of view. Classically, the solution can be found only by 
accumulating a sufficient amount of information step by 
step through the process of excluding unsuccessful in- 
puts. Our inequality (J5J and its violation in Eq. iJJJ offer 
one way of expressing these facts mathematically. 

One possible explanation of the above counterintuitive 
interpretation is that the quantum computer can invoke 
temporal nonlocality to select promising inputs without 
exhausting all possibilities. It looks as if the queries per- 
formed by the quantum computer were connected in a 
nonlocal way in time. Therefore, it is fair to say that 
the violation of our inequality in quantum computation 
represents the temporal nonlocality. As the violation of 
conventional (spatial) Bell inequalities implies that quan- 
tum theory is intrinsically nonlocal, the violation of the 
information-theoretic temporal inequality suggests the 
notion of temporal nonlocality in quantum information 
processing. 

Spatial correlations due to entanglement between 
many qubits are considered to be an origin of the power 
of quantum computation in the standard view of quan- 
tum parallelism. However, the above interpretation fo- 
cuses on temporal correlations between different compu- 
tational steps and succeeds in revealing another weird 
aspect of quantum computation. Parallel computation 
due to quantum superposition is counterintuitive, but ex- 
ploiting seemingly unperformed computation is far more 
so. These two different but not conflicting perspectives 
are based on spatial and temporal approaches, respec- 
tively. 

The information-theoretic temporal Bell inequality is 
formulated only for the so-called "database" search prob- 
lem here, but it can be generalized to other problems in 
principle. Suppose the amount of information obtained 
by solving the problem is / bits. (This can be easily de- 
termined as n bits in the unstructured database search 
problem discussed above, although determining it is gen- 



erally a nontrivial task.) Then, in a similar way to de- 
riving Eq. we have 

I<H(A ,...,A L ), (8) 

where Aq,..,,Al are some observables from which the 
solution to the problem is inferred . By combining this 
inequality with Eq. (J3J, we have 

I <H(A L \A L _ 1 ) + --- + H(A 1 \Ao)- (9) 

[The last term on the right-hand side of Eq. J3J|, H (Ao), 
can be omitted as is done in the derivation of Eq. (JSJ).] 
This also should be satisfied by any classical algorithm 
deterministically solving the problem and can be violated 
by quantum algorithms, if any. It remains open to apply 
the inequality to other quantum algorithms and obtain 
further insights into essential features of quantum com- 
putation. 

VI. SUMMARY 

An information-theoretic temporal Bell inequality was 
formulated for the so-called "database" search problem. 
Provided that the problem is successfully solved with cer- 
tainty, any classical algorithm satisfies the inequality, but 
Grover's algorithm for quantum computation violates it. 
Separating classical computations from quantum ones by 
means of a Bell-type inequality suggests a novel way to 
contrast them, which is similar to analyzing nonlocal ef- 
fects due to entanglement. Furthermore, a notion of tem- 
poral nonlocality was suggested by the violation of the 
inequality. I hope the present approach will eventually 
contribute to shed new light on the nature of quantum 
computation and temporal correlations in quantum in- 
formation processing in general. 
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